Processing a Multimedia Join through the Method of Nearest Neighbor Search (long version)
نویسندگان
چکیده
This paper presents a multimedia join operator that is carried out through the method of the nearest neighbor search. In contrast to related approaches that utilizes a similarity function to perform a join between two instances of the input tables, we adopt the more flexible and widely used nearest neighbor method. First, we introduce a simple nearest neighbor search algorithm based on an nested-loop execution strategy, second an optimized version is proposed which takes advantage of query point clustering in a hypersphere. Several experiments are performed to demonstrate the efficiency of the optimized algorithm over the simple one for different datasets, datasizes and dimensions.
منابع مشابه
Processing a multimedia join through the method of nearest neighbor search
Commonly used content-retrieval systems focus on the problem of finding the nearest neighbor (NNsearch) for a given single query object out of a database of media objects [1]. However, there are only few attempts [2,3] that realize join operations on two multimedia tables, where the multimedia data components are represented by their respective feature vectors. The necessity of using multimedia...
متن کاملSupporting KDD Applications by the k-Nearest Neighbor Join
The similarity join has become an important database primitive to support similarity search and data mining. A similarity join combines two sets of complex objects such that the result contains all pairs of similar objects. Well-known are two types of the similarity join, the distance range join where the user defines a distance threshold for the join, and the closest point query or k-distance ...
متن کاملEfficient index-based KNN join processing for high-dimensional data
In many advanced database applications (e.g., multimedia databases), data objects are transformed into high-dimensional points and manipulated in high-dimensional space. One of the most important but costly operations is the similarity join that combines similar points from multiple datasets. In this paper, we examine the problem of processing K-nearest neighbor similarity join (KNN join). KNN ...
متن کاملHigh Performance Data Mining Using the Nearest Neighbor Join
The similarity join has become an important database primitive to support similarity search and data mining. A similarity join combines two sets of complex objects such that the result contains all pairs of similar objects. Well-known are two types of the similarity join, the distance range join where the user defines a distance threshold for the join, and the closest point query or k-distance ...
متن کاملOn Optimizing Nearest Neighbor Queries in High-Dimensional Spaces
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, especially in content-based indexing of multimedia data. For an optimization of the query processing, accurate models for estimating the query processing costs are needed. In this paper, we propose a new cost model for nearest neighbor queries in high-dimensional space, which we apply to enhance t...
متن کامل